Rank in Wordlist | Frequency | Word |
---|---|---|
17977 | 5 | 10,000 |
30624 | 2 | 1,748 |
30625 | 2 | 1,765 |
30631 | 2 | 100,000 |
30729 | 2 | 131,136 |
30925 | 2 | 20,000 |
30977 | 2 | 30,000 |
31004 | 2 | 36,000 |
31042 | 2 | 5,510 |
31122 | 2 | 75,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
30615 | 2 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
31109 | 2 | 70% |
44178 | 1 | 1.6% |
44202 | 1 | 10.9% |
44906 | 1 | 15.4% |
45260 | 1 | 18% |
45398 | 1 | 19.71% |
45658 | 1 | 2.7% |
45837 | 1 | 24% |
46119 | 1 | 33% |
46159 | 1 | 35% |
Rank in Wordlist | Frequency | Word |
---|---|---|
47325 | 1 | A&E |
47326 | 1 | A&M |
47976 | 1 | CC&C |
50735 | 1 | R&D |
Rank in Wordlist | Frequency | Word |
---|---|---|
3401 | 53 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
5599 | 28 | .' |
31548 | 2 | Mahomet's |
31558 | 2 | Martin's |
31560 | 2 | Mashriqi's |
31642 | 2 | Pakistan's |
31651 | 2 | People's |
45926 | 1 | 26°57'10N |
45957 | 1 | 27°52'10N |
46002 | 1 | 29°48'0N |
46083 | 1 | 30°30'N |
Rank in Wordlist | Frequency | Word |
---|---|---|
31833 | 2 | UTC+5 |
44180 | 1 | 1.6E+14 |
44663 | 1 | 13+11 |
47335 | 1 | AC+793888 |
48575 | 1 | EEZ+TIA |
51611 | 1 | UTC/GMT+5 |
57983 | 1 | ايسٽروجن+پروجيسٽران |
65863 | 1 | خاڻ+وٽ |
85696 | 1 | وقت+01:00 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3116 | 59 | https://www |
13409 | 8 | http://encyclopediasindhiana |
14183 | 8 | هه/ |
17976 | 5 | 1/16 |
20703 | 4 | 5/8 |
20809 | 4 | https://pahenjiakhbar |
24379 | 3 | 1/8 |
24391 | 3 | 11/2 |
24838 | 3 | https://elevation |
24839 | 3 | https://sindhsalamat |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots